Beta Distribution

Characterization

PDF:

f(x;α,β)=1B(α,β)xα1(1x)β1

B(α,β) is called a Beta function, and is just a normalization constant.
B(α,β)=10tα1(1t)β1dt=Γ(α)Γ(β)Γ(α+β)

Γ(x) is a gamma function.
Its general CDF function does not have an analytic expression.

Applications

Beta Distribution is the conjugate prior of many discrete distributions, including the Bernoulli and binomial distribution, in Bayesian inference.
The parameters, α and β, can be intuitively interpreted as positive and negative "psudo samples". The posterior hyperparameters of Bernoulli likelihood are simply α+ni=1xi,β+nni=1xi.

Properties

mode

Mode, in statistics, is the value that appears most often in a set of data, corresponding to the peak in the PDF. The anti-mode is the lowest point of the probability density curve.

The mode of a Beta distributed random variable X with α, β > 1 (and anti-mode with α, β < 1), is

α1α+β2

mean

μ=E[X]=αα+β

Reference

Beta Distribution: https://en.wikipedia.org/wiki/Beta_distribution


The followings are the illustrations of the hyperparameters.
ex1.png
ex2.png
Note that y axis stands for probability density, which can exceed 1 with no surprise (probability is the integral of probability density).